KMID : 1022420110030020065
|
|
Phonetics and Speech Sciences 2011 Volume.3 No. 2 p.65 ~ p.70
|
|
Improvement of Rejection Performance using the Lip Image and the PSO-NCM Optimization in Noisy Environment
|
|
Kim Byoung-Don
Choi Seung-Ho
|
|
Abstract
|
|
|
Recently, audio-visual speech recognition (AVSR) has been studied to cope with noise problems in speech recognition. In this paper we propose a novel method of deciding weighting factors for audio-visual information fusion. We adopt the particle swarm optimization (PSO) to weighting factor determination. The AVSR experiments show that PSO-based normalized confidence measures (NCM) improve the rejection performance of mis-recognized words by 33%.
|
|
KEYWORD
|
|
audio-visual speech recognition, particle swarm optimization, normalized confidence measure, rejection performance
|
|
FullTexts / Linksout information
|
|
|
|
Listed journal information
|
|
|